Skip to content

[AMD] chore: add temp mi355x runners and disable mi355x nodes 0-3 temporarily#203

Merged
cquil11 merged 1 commit into
mainfrom
mi355x-updates
Nov 10, 2025
Merged

[AMD] chore: add temp mi355x runners and disable mi355x nodes 0-3 temporarily#203
cquil11 merged 1 commit into
mainfrom
mi355x-updates

Conversation

@cquil11
Copy link
Copy Markdown
Collaborator

@cquil11 cquil11 commented Nov 9, 2025

@japarada @ChrisMasonAMD @araslanix

Nodes mi355x-amd_{0,1,2,3} are being validated. Two new MI355X nodes have been supplied for the time being.

  • Setup mi355x-amd_4 as GPU3DBD
  • Setup mi355x-amd_5 as GPU3D32
  • Disable mi355x-amd_{0,1,2,3}

Run the following for verification that runs all configs on both new nodes: https://github.com/InferenceMAX/InferenceMAX/actions/runs/19213616451

@cquil11 cquil11 marked this pull request as ready for review November 9, 2025 20:05
@cquil11 cquil11 requested a review from a team as a code owner November 9, 2025 20:05
Copilot AI review requested due to automatic review settings November 9, 2025 20:05
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull Request Overview

This PR updates the mi355x runner configuration to temporarily use new runners while nodes 0-3 undergo maintenance. The changes replace the original four runners (mi355x-amd_0 through mi355x-amd_3) with two temporary runners (mi355x-amd_4 and mi355x-amd_5), which are configured as GPU3DBD and GPU3D32 respectively.

  • Commented out original mi355x runners (nodes 0-3) that are undergoing maintenance
  • Added two temporary replacement runners (nodes 4-5)

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

@cquil11 cquil11 changed the title update mi355x runners to add temp runners and remove nodes 0-3 chore: add temp mi355x runners and disable mi355x nodes 0-3 temporarily Nov 9, 2025
@cquil11 cquil11 merged commit 917372f into main Nov 10, 2025
27 of 54 checks passed
@cquil11 cquil11 deleted the mi355x-updates branch November 10, 2025 19:01
@cquil11 cquil11 restored the mi355x-updates branch November 11, 2025 23:07
@functionstackx functionstackx deleted the mi355x-updates branch December 4, 2025 21:31
@cquil11 cquil11 added the AMD label Apr 8, 2026
@cquil11 cquil11 changed the title chore: add temp mi355x runners and disable mi355x nodes 0-3 temporarily [AMD] chore: add temp mi355x runners and disable mi355x nodes 0-3 temporarily Apr 8, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants